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AMENDMENTS TO THE SPECIFICATION 

Please amend the paragraphs f006], [001 1], [0014], [0018J. [0019], 10022]. [0023], [0027], 
[0028]. [0029], [0030], [0031], [0032], [0035], [0038], [0043], [0045], [0050], [0055] and Tables 1 to 4 
as follows: 

[006] In one embodiment, a composition is provided that comprises a protein in crystalline form 
wherein the protein has 65%, 70%, 80%, 90%, 95% or greater identity with residues 3 9 766 of SEQ. ID 
Ner-1- 13-740 of SEP ID NQ:3 . 

[001 1] In one embodiment, a method is provided for forming crystals of a protein comprising: forming a 
crystal I i/at ion volume comprising: a protein that has at least 65%. 70%, 80%. 90%, 95% identity' with 
residues 39 - 766 of SEQ; 11) No. 4 13-740 of SEQ ID NO:3 in a concentration between 1 mg/ml and 50 
mg/rril: 5-50% wv of precipitant wherein the precipitant comprises one or more members of the group 
consisting of PEG MME having a molecular weight range between 300-10000, and PEG having a 
molecular weight range between 100-10000; optionally 0.05 to 0.8M additives wherein the additives 
comprises sarcosine or 0.5 to 25% additives wherein the additives comprises xylitrol; and wherein the 
crystallization volume has a pH between pH 5 and pH 9; and storing the crystallization volume under 
conditions suitable for crystal formation. The method optionally further comprises using 0.05-0. 2M 
buffers selected from the group consisting of Tris-HCl, bicine and combinations thereof. The method 
also optionally further includes performing the crystallization at a temperature between S°C - 25°C. 

[0014] In one embodiment, machine readable data storage medium is provided having data storage 
material encoded with machine readable data, the machine readable data comprising: structure 
coordinates that have a root mean square deviation of alpha-carbon atoms of less than 3A when 
superimposed on alpha-carbon atoms positions of corresponding atomic coordinates of Figure 3, the root 
mean square dev iation being calculated based only on those alpha-carbon atoms of amino acid residues in 
the structure coordinates That are also present in residues lw "wt-^; \4 * » H) V. t >3 to ol SI O ID 
NO:3 . 

[0018] In one embodiment, a method is provided for displaying a three dimensional representation of a 
structure of a protein comprising: taking machine readable data comprising structure coordinates that 
have a root mean square deviation of alpha-carbon atoms of less than 3 A when superimposed on alpha- 
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carbon atom positions of corresponding atomic coordinates of Figure 3, the root mean square deviation 
being calculated based only on those alpha-carbon atoms of amino acid residues in the structure 
coordinates that are also present in residues shown in Tables 1, 2, 3 and/or 4 or residues 3 9 -766 of SEQv 

11) No. 1 1 5-" fi LMjfJ p_N i ? „,imp iti u a t. ree d mcr^H nal represent tt on oi a structure based on 

the structure coordinates: and displaying the three dimensional representation. 

[0019] in another embodiment, a method is provided for displaying a three dimensional representation of 
a structure of a protein comprising: displaying a computer model for a protein binding pocket, at least a 
portion of the computer model having a surface contour that has a root mean square deviation of less than 
3A when superimposed on a surface contour defined by atomic coordinates of Figure 3, the root mean 
square deviation being calculated based only on alpha-carbon atoms in the structure coordinates of Figure 
3 that are present in residues shown in Tables 1, 2, 3 and/or 4 or residues 39—66 of 4D No. 1 13- 
740 of SEP ID NO:3 . 

[0022] In one embodiment, a computational method is provided comprising: taking machine readable 
data comprising structure coordinates that have a root mean square deviation of alpha-carbon atoms of 
less than 3A when superimposed on alpha-carbon atom positions of corresponding atomic coordinates of 
Figure 3, the root mean square deviation being calculated based only on those alpha-carbon atoms of 
amino acid residues in the structure coordinates that are also present in residues shown in Tables 1,2,3 
and/or 4 or residues 39 766 o f SEQ. ID No. 1 1 3- 74 0 of SEQ ID NO:3; computing phases based on the 
structural coordinates; computing an electron density map based on the computed phases; and 
determining a three-dimensional crystal structure based on the computed electron density map. 

[0023] In another embodiment, a computational method is provided comprising: taking an X-ray 
diffraction pattern of a crystal of the target protein; and computing a three-dimensional electron density 
map from the X-ray diffraction pattern by molecular replacement, wherein structure coordinates used as a 
molecular replacement model comprise structure coordinates that have a root mean square deviation of 
alpha-carbon atoms of less than 3 A when superimposed on alpha-carbon atom positions of corresponding 
atomic coordinates of Figure 3. the root mean square deviation being calculated based only on those 
alpha-carbon atoms of amino acid residues in the structure coordinates that are also present in residues 
shown in Tables 1, 2, 3 and/or 4 or residues 39 766 of SEQ. ID No. i 13-740 of S EP I D NO:3 . This 
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method may optionally further comprise determining a three-dimensional crystal structure based upon the 
computed three-dimensional electron density map, 

[0027] In one embodiment, a method is provided for evaluating a potential of an emits to associate with 
a protein comprising: creating a computer model of a protein structure using structure coordinates that 
comprise structure coordinates that have a root mean square deviation of alpha-carbon atoms of less than 
3 A when superimposed on alpha-carbon atom positions of corresponding atomic coordinates of Figure 3. 
the root mean square deviation being calculated based only on those alpha-carbon atoms of amino acid 
residues in the structure coordinates that are also present in residues shown in Tables 1,2,3 and/or 4 or 
residues 39 766 ofSEQ. IP Nu. 1 13-740 of SEP ID NO: 3. 1; performing a fitting operation between the 
entity and the computer model; and analyzing results of the fitting operation to quantify an association 
between the entity and the model. 

[00281 In another embodiment, a method is provided for evaluating a potential of an entity to associate 
with a protein comprising: computing a computer model for a protein binding pocket, at least a portion of 
the computer model having a surface contour that has a root mean square deviation of less than 3 A when 
superimposed on a surface contour defined by atomic coordinates of Figure 3, the root mean square 
deviation being calculated based only on alpha-carbon atoms in the structure coordinates that are present 
in residues shown in Tables I, 2, 3 and/or 4 or residues 39 - 766 of -SE Q. ID No. 1 1 3-740 of SEP ID 
NQ:3 ; evaluating a potential of an entity to associate with the surface contour by performing a fitting 
operation between the entity and the surface contour; and analyzing results of the fitting operation to 
quantify an association between the entity and the computer model, 

[0029] hi another embodiment, a method is provided for identifying entities that can associate with a 
protein comprising: generating a three-dimensional structure of a protein using structure coordinates that 
comprise structure coordinates that have a root mean square deviation of alpha-carbon atoms of less than 
3 A when superimposed on alpha-carbon atom positions of corresponding atomic coordinates of Figure 3, 
the root mean square deviation being calculated based only on those alpha-carbon atoms of amino acid 
residues in the structure coordinates that are also present in residues shown in Tables 1, 2, 3 and/or 4 or 
residues 39 766 o f S EP^I&^-l- 13-740 of SEP ID NO:3: employing the three-dimensional structure to 
design or select an entity that can associate with the protein; and contacting the entity with a protein 
having at least 65% identity with residues 39-?66-^f-SEp. IP N o. 1 13-740 of SEP I D NO:3 . 
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[0030] In another embodiment, a method is provided for identifying entities that can associate with a 
protein comprising: computing a computer model for a protein binding pocket, at least a portion of the 
computer model having a surface contour that has a root mean square deviation of less than 3A when 
superimposed on a surface contour defined by atomic coordinates of Figure 3, the root mean square 
deviation being calculated based only on alpha-carbon atoms in the structure coordinates that are present 
in residues shown in Tables 1, 2, 3 and/or 4 or residues 39 766 of SE Q4D No. 1 13-740 of SEP ID 
NO:.}:, employing the computer model lo design or select an entity that can associate with the protein; and 
contacting the entity with a protein having at least 65%, 70, 80, 90, 95% identity with residues 39-~66 of 
SEQ. I D No. 1 13-740 of SEQ ID NO:3 . 

[0031] In another embodiment, a method is provided for evaluating the ability of an entity to associate 
with a protein, the method comprising: constructing a computer model defined by structure coordinates 
that comprise structure coordinates that have a root mean square deviation of alpha-carbon atoms of less 
than 3A when superimposed on alpha-carbon atom positions of corresponding atomic coordinates of 
Figure 3, the root mean square deviation being calculated based only on those alpha-carbon atoms of 
amino acid residues in the structure coordinates that are also present in residues shown in Tables 1, 2, 3 
and/or 4 or residues 39 766 of SEQ. ID No. I 13-740 of SEP IDNP:3 ; selecting an entity to be evaluated 
by a method selected from the group consisting of (i) assembling molecular fragments into the entity, (ii) 
selecting an entity from a small molecule database, (iii) de novo ligand design of the entity, and (iv) 
modifying a known ligand for DPP1V, or a portion thereof; performing a fitting program operation 
between computer models of the entity to be evaluated and the binding pocket in order to provide an 
energy-minimized configuration of the entity in the binding pocket; and evaluating the results of the 
fitting operation to quantity the association between the entity and the binding pocket model in order to 
evaluate the ability of the entity to associate w it h the binding pocket. 

[0032J In another embodiment, a method for evaluating the ability of an entity to associate with a 
protein, the method comprising: computing a computer model for a protein binding pocket, at least a 
portion of the computer model ha\ ing a surface contour that has a root mean square deviation of less than 
3 A when superimposed on a surface contour defined by atomic coordinates of Figure 3. the root mean 
square deviation being calculated based only on alpha-carbon atoms in the structure coordinates that are 
present in residues shown in Tables 1 , 2, 3 and/or 4 or residues 39 ' 766 of SEQ. ID No. I 1 3-740 of SEQ 
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ID_N03; selecting an entity to be evaluated by a method selected from the group consisting of (i) 
assembling molecular fragments into the entity, (ii) selecting an entity from a small molecule database, 
(Hi) de novo ligand design of the entity, and (iv) modifying a known ligand for an DPPIV, or a portion 
thereof: performing a fitting program operation between computer models of the entity to be evaluated 
and the binding pocket in order to provide an energy-minimized configuration of the entity in the binding 
pocket; and evaluating the results of the fitting operation to quantify the association between the entity 
and the binding pocket model in order to evaluate the ability of the entity to associate with the said 
binding pocket, 

[0035] In another embodiment, a method is provided for identifying an entity that associates with a 
protein comprising: taking structure coordinates from diffraction data obtained from a crystal of a protein 
that has at least 65%, 70%, 80%, 90%, 95% or more identity with the residues 39 766 of SEQ. ID No. \ 
13-7 -10 of SF.Q ID NO: 3 : and performing rational drug design using a three dimensional structure that is 
based on the obtained structure coordinates. The protein crystals may optionally have a crystal lattice 
having unit cell dimensions, +/- 5%, of a=121.53A b= 124.1 1 A and c=144.42A, ot=f=90°, (3=1 14.6°. The 
method may optionally further comprise selecting one or more entities based on the rational drug design 
and contacting the selected entities with the protein. The method may also optionally further comprise 
measuring an activity of the protein when contacted with the one or more entities. The method also may 
optionally further comprise comparing activity of the protein in a presence of and in the absence of the 
one or more entities; and selecting entities where activity of the protein changes depending whether a 
particular entity is present. The method also may optionally further comprise contacting cells expressing 
the protein with the one or more entities and detecting a change in a phenotype of the cells when a 
particular entity is present. 

[0038] Figure 3 lists a set of atomic structure coordinates for DPPIV (SEQ ID NO :3) as derived by X-ray 
crystallography from a crystal thai comprises the protein. The following abbreviations are used in Figure 
3: "X. Y. 7" erystallogruphicalh define the atomic position of the clement measured; "B"' is a thermal 
factor that measures movement of the atom around its atomic center: "'Oec'" is an occupancy factor that 
refers to the fraction of the molecules in which each atom occupies the position specified by the 
coordinates (a value of "1" indicates that each atom has the same conformation, i.e., the same position, in 
all molecules of the crystal). ''NAG" stands for N-Acetylglucosamine. 
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[0043] The present invention relates to a member of the S9 family of human proteases known as 
dipeptidyi peptidase IV (DPPIV) (SEP ID NO:l) . More specifically, the present invention relates to 
DPPIV in crystalline form, methods of forming crystals comprising DPP1V, methods of using crystals 
comprising DPPIV, structure coordinates and a crystal structure of DPPIV, and methods of using the 
structure coordinates and crystal structure. 

10045] Dipeptidyi Peptidase IV (DPPIV) (SEP ID NP:i) is a serine protease of Clan SC family S9. 
DPPIV is a 240kDa homodimeric, multi-functional type- II membrane bound glycoprotein, widely 
distributed in all mammalian tissues, but highly expressed in kidney, liver and endothelium. DPPIV is 
also known as DPP4. CD26, adenosine deaminase completing protein 2 or adenosine deaminase binding 
protein (ADAbp). DPPIV consists of a short cytoplasmic domain of six amino acids, followed by a 
hydrophobic transmembrane domain {amino acids 7-28) and an extracellular sequence of 739 amino 
acids. 

[0050] In another embodiment, DPPIV comprises residues 39 766 of SEQ. ID No. 1 13-740 of SEQ ID 
NO: 3 which comprises the active site domain of wild-type DPPIV that is represented in the set of 
structural coordinates shown in Figure 3. 

[0055] One or more of the sets of amino acids set forth in the tables is preferably conserved in a variant 
of DPPIV. Hence, DPPIV may optionally comprise a sequence that has at least 65% identity, preferably 
at least 70%, 80%, 90%, 95% or higher identity with any one of the above sequences (e.g., all of SEQ. ID 
Ne^+ SEP ID NO:3 or residues 39 766 of SEQ. ID No. I 13-740 of SEP ID NO: 3) where at least the 
residues shown in Tables 1, 2, 3 and/or 4 are conserved with the exception of 0, I, 2, 3, or 4 residues. It 
should be recognized that one might optionally vary some of the binding site residues in order to 
determine the effect such changes have on structure or activity. 

Table 1: Amino Acids encompassed by a 4-Angstrom radius around the DPPIV active site 
(SEQ ID NO:3). 



\RG [[111]] 99 


TYR [[533]] 521 


TYR [[652]] 640 


GLU [[191]] 179 


SER [[616]] 604 


ASN [[696]] 684 
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GLU [[192]] 180 


TYR [[6171] 605 


HIS [[726]] 714 


^^ERjfiyS]] 183 


VAL [[642]] 630 


ASP [[694]] 682 


PHI 1 *4 [1 331 


TYR [[648]] 636 





Table 2: Amino Acids encompassed by a 7-Angstrom radius around the DPPiV active site 
(SEP ID NO:3) . 



ARG [ 1 1 1 1 ; | 99 


TYR [[533]] 521 


TRP Q645 1; 635 


HIS [[112]] 100 


GLY f[535]] 523 


TYR [[648JJ 


TRP ff 187]] 175 


PRO [[ 536 1] 524 


ASP [[649]] 637 


GLU [[390]] 1 8 


TYR [[571]] 559 


TYR [[652]] 640 


GLU [[191]] 179 
GLU ]]192]] 180 


TRP [[61 5]] 603_ 
SER || 616|| 604 


THR|[ 653 1| 641 
\RG ||655 | '.4" 


VAL I] 193] | 181 
' I'lll ||l»4|j 182 
SF.R || 195|1 183 


TYR [[617]] 605 


TYR [[656]] 644 


GLY [[618]] 606 


ASN [[69oi] 684 


TYR [[620]] 608 


VAL [[697]] 685 


ARG [[342]] 330 
PHE [[343!] 331 


ALA [[640]] 628 


HIS |[726|i 714 


PRO [[641)1 629 


ASF [1 694]] 687 


ARG [[344]] 332 


VAL [[642]] 630 





Table 3: Amino Acids encompassed by a 10-Angstrom radius around the DPPIV active site 
(SEP ID NO:3) . 



ARG [[111]1 99 


ILE [[391]] 379 


SER [[643]] 631 


HIS [[112]] 100 


VAL [[532]] 520 


ARG || b44 1| 632 
TRP [[645]] 633 


TRP [[187]] 175 


TYR [[533]] 521 


VAL [[188]] 176 


ALA [[534]] 522 


TYR [j 647]] 635 


TYR [[189]] 177 


GLY [[535]] 523 


TYR |] 648 |i 636 


GLU [[190]] 178 


PRO [[536]] 524 


\SP [[649]] 637 
SER [[650 1] 638 


GLU || 191 1| 179 


CYS [[537]] 525 


GLU [[192]] 180 


SER [[538]] 526 


VAL |1651]] 639 
TYR [| 652]] 640 


VAL [[193]] 181 


TYR [[571]] 559 


Pill: [[194]] 182 


MET [[577|1 565 


1HR || b53\\ 641 " 


SLR [[195]] 183 


LEU [[584]] 572 


GLU [[654]] 642 
ARG [[655]] 643 


ALA [[196]] 184 


GLU [[588]] 576 


TYR [[242]] 230 


GLY [[614]] 602 


TYR [[656]] 644 


CYS i [287]J 275 


TRP [[615]] 603 


ML) [[65" || 645 


GLN ||306]] 294 
TRP [[339]] 327 


SER [[616]] 604 
TYR [[ 61 7] j 605 ~ 


HIS [[690]] 678 
ASP [[694]] 682 


VAL |[340]] 328 


GLY [[618]] 606 


ASP [[695]] 683 


GLY [[341]] 329 


GLY [[619]] 607 


ASN [[696]] 684 ; 


ARG [[342]] 330 


TYR [[620]] 608 


VAL [[697]] 685 | 


PHE [[343]] 331 


VAL [[621]] 609 


HIS [[698]] 686 j 
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ARG [[344]] 332 


VAL [[639]] 627 


GLN [[701]] 689 


PRO [[345]] 333 


ALA [[640]] 628 


HIS [[726]] 714 


SER [[3461] 334 


PRO [[641]] 629 


GLY [[72711 715 


GLU [[34731 335 


VAL [[642]] 630 





Table 4: Amino Acids encompassed by a 5-Angstrom radius around the AR and CD 
dimerization interfaces (SEP ID NO:3) . 



1 Chain A 


Chain B 


Chain C 
PROCff [220H 208" 


Chain D 


SER A [[225]] 213 


PRO B 1122011 208 


I I I 1.) |]221 j] 209 


LYRA [f227]]2S5 


ILL B 112227 210 


LEU C ([221]] 209 


ILL D |[222]j 210 


SLR \ [228 1 i 


GLU B [[2231] 211 


ILE C [[22211 210 


GLU D [[223]] 2J1 


ASP A [[229]] 2.12 


TYR B [[224]] 2J2 


GLU C [[223]] 211 


TYR D [[224]] 212 


GLU A [[230]] 218 


SER B 1122511 213 


TYR C [[224]] 2L2 


SLR D| [225}] 213 


1 11 A [[232JJ 220 


TYR B [[2271] 2 1 5 


TYR C [[227]] 215 


SLR D||228J]216 


GLN A [[233]] 221 


SER B [[228]] 216 


SLR( ]]?. >i 246 


ASP D [[229]] 217 


TYR A [[234]] 222 


ASP B [[229]1 2L7 


ASP C [[2291] 217 


GLU D [123011 218 


PRO A f[235"|] 223 


GLU B 1123011 218 


GLU C [[230]] 218 


SER D [[23111 219 


LYS A [[236]] 224 


SERB [[2311] 219 


SERC [[231H 219 


LEU D [[232H 220 


THR A [[237]] 225 


TYR B [[234]] 222 


LEU C [[232]] 220 


GLN D [[233]] 221 


ARG A [[239]] 227 


THR B [[23711 225 


GLN Cf [23 3]] 221 


TY R D [[234]] 222 


TYR A [[242]] 230 


ARG B [[239]] 227 


TYR C [[234]] 222 


PROD [[235]] 223 


LYS A [[244]] 232 


GLN B [[700]] 688 


PRO C [[235]] 223 
THR C [[237]] 225, 
TYR C [[242]] 230 


THR 1) [[237]] 225 


ALA A [[245]] 233 


ALA B [[703]] 691 


ARG D [[239]] 227 


SERA [[706]] 694 


GLN B [[704]] 692 


LYS D [f244]l 232 


LYS A [[707]] 695 


'LYS B [[707]] 695 


PRO C [[243]] 231 


ALA D [[24531 233 


LEU A [[709]] 697 


LEU B [[709]] 697 
VAL B [[7 in"]] 698 


LYS C [[244]] 232 


Al A D [[24711 235 


VAL A [[71031 698 


GLU C [[64611 634 


TYR D [[647]] 635 


i ASP A [[71 1]] 6299 


ASP B [[711]] 699 


THR C [[67311 66J_ 


MET D [1675 1] 663 
HIS D [[690]] 67.8 


GLY A [[713]] 70J_ 


GLY B [[713j| 701 


LEU C [[688]] 676 


VAL A [[714]] <02 
PHEA||?16!1>4 
GLN A [[71 "Ti "05 


VAL B 1171411 702 
ASP B [[715]] 703 
PI IF. B [[716]] 704 


PHE C [[699] 1 687 
GLN C f[700H 688 
SERC [[70211 690 


GLN D || 700 1] 688 
SLR D 1 1 702]) 690 
GI N D {[ "704)1 692 


ML J A [[7 19]] 70- 


GLN B [[717]] 705 


GLN C [[704]] 692 


SLR D || 70611 694 j 


IRP V 1C20H 708 


ALA B [[718]] 706 


LEUC [pj9j] ou- 


ASP D in 1 ■! w,.) 


TYR A [172.1 || 709 




VALC [[7103] 698 


VAL D [[71-1]] ^02 


11 IRA [[7221] "10 




ASP C [[7 11]] 699. 


ASPDfflv. 'ui 


ASP A [[723]] 7J_L 




GLY C [[713]] 701 


PHE D [[716 ; 04 






VAL C [[714]] 702 


GLN I) [[' H2<j 






ASP C [[715]] 703 


\LA D [171811 706 | 






PHE C [[716]] 704 


MET D HE 






GLN C [[717]] 705 


TRP D [[72i 1 ^ { 






ALA C [[71811 706 


TYR D [[72!]] 709 







MET C [[719]] 707 


THRD[|722]] "10 






TRP C [[720]] 7_0S 








TYR C !p2!j| 709 
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